Controlling centrality in complex networks 
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Spectral centrality measures allow to identify influential individuals in social groups, to rank Web 
pages by their popularity, and even to determine the impact of scientific researches. The centrality 
score of a node within a network crucially depends on the entire pattern of connections, so that 
the usual approach is to compute the node centralities once the network structure is assigned. We 
face here with the inverse problem, that is, we study how to modify the centrality scores of the 
nodes by acting on the structure of a given network. We prove that there exist particular subsets 
of nodes, called controlling sets, which can assign any prescribed set of centrality values to all the 
nodes of a graph, by cooperatively tuning the weights of their out-going links. We show that many 
large networks from the real world have surprisingly small controlling sets, containing even less than 

5 — 10% of the nodes. These results suggest that rankings obtained from spectral centrality measures 
have to be considered with extreme care, since they can be easily controlled and even manipulated 
by a small group of nodes acting in a coordinate way. 

PACS numbers: 89.75.Hc,89.75.-k,89.75.Fb 



Modelling social, biological and information- 
technology systems as complex networks has proven to 
be a successful approach to understand their function 
Among the various aspects of networks which 
have been investigated so far, the issue of centrality, and 
the related problem of identifying the central elements 
in a network, has remained pivotal since its first intro- 
duction. The idea of centrality was initially proposed 
in the context of social systems, where it was assumed 
a relation between the location of an individual in the 
network and its influence and power in group processes 
@, @. Since then, various centrality measures have been 
introduced over the years to rank the nodes of a graph 
according to their topological importance. Centrality 
has found many applications in social systems |6|, in 
biology @ and in man-made spatial networks (8l4lQj. 

Among the various measures of centrality, such as 
those based on counting the first neighbours of a node 
(degree centrality), or the number of shortest paths pass- 
ing through a node (betweenness centrality) [U, [l2| , a 
particularly important class of measures are those based 
on the spectral properties of the graph [l3j |. Spec- 
tral centrality measures include the eigenvector centrality 
[Hi EH, the alpha centrality [16], Katz's centrality fig 
and PageRank [l8[, and are often associated to simple 
dynamics taking place over the network, such as various 
kinds of random walks [l9l-[2lj. As representative of the 
class of spectral centralities, we focus here on eigenvec- 
tor centrality, which is based on the idea that the impor- 
tance of a node is recursively related to the importance of 
the nodes pointing to it. Given an unweighted directed 
graph G = (V,E) with TV = \V\ nodes and K = \E\ 
links, described by the N x N adjacency matrix A, the 
eigenvector centrality Co of G is defined as the eigenvec- 
tor of A 1 associated to the largest eigenvalue po, which 



in formula reads A 1 cq — poc [l4]4l(|. If the graph is 
strongly connected, then the Perron-Frobenius theorem 
guarantees that Co is unique and positive. Therefore, cq 
can be normalised such that the sum of the components 
equals 1, and the value of the z-th component represents 
the centrality score of node z, i.e. the fraction of the 
total centrality associated to node i. In this Article we 
show how to change the eigenvector centrality scores of 
all the nodes of a graph by performing only local changes 
at node level. As a first step (see the Methods Section) 
we have proved that, given any arbitrary positive vector 
c G R^, c > 0, and c ^ c , it is always possible to assign 
the weights of all the links of a strongly-connected graph 
G and to construct a new weighted network G u , with the 
same topology as G and with eigenvector centrality equal 
to c: 

Aic = pc, (1) 

where A u is the weighted adjacency matrix of G w . 

This is illustrated in Fig. [1] for a graph with N = 4 
nodes and K = 5 links. In the original unweighted graph 
G, node 2 is the node with the highest eigenvector cen- 
trality, followed in order by node 3, node 4, and node 
1. Now, if we have the possibility of tuning the weights 
of each of the five links, we can set any centrality value 
to the nodes of the graph. In figure we show, for in- 
stance, how to fix the weights of the five links in order to 
construct: i) a weighted network G u in which all nodes 
have the same centrality score, and ii) even a weighted 
network G u in which the centrality ranking is totally re- 
versed with respect to the ranking in G. 

As shown in the example, given a graph G, by control- 
ling the weights of all the links, it is always possible to 
set any arbitrary vector c as the eigenvector centrality of 
the graph. However, tuning the weights of all the K links 
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FIG. 1. An example of how to tune the link weights to change the node centrality scores. The graph G with N — 4 
nodes and K — 5 links shown in panel (a) is strongly-connected and has an eigenvector centrality Co = {0.18, 0.33, 0.27, 0.22}. 
By ranking the nodes according to the components of Co, we obtain that node 2 is the most central one, followed in order by 
node 3, node 4, and node 1. We can now set the weights of the five links u? = {ui, U2, ui3, ^4, ^5} in such a way that Eq[T]is 
satisfied for any given centrality vector c / Co. For instance, we can get a weighted network G u in which all nodes have the 
same centrality, by solving Eq[T]with a centrality vector c = {1/4, 1/4, 1/4, 1/4} and p = 3.0. We obtain a vector of weights: 
uj — {a, 3, 3,3,3 — a} which, for < a < 3, guarantees that all the link weights of the graph are positive. The resulting 
network G u is shown in panel (b). As expected, we have K — N = 1 free parameter (namely a) since the graph has N = 4 
nodes and K — 5 links. Instead, if we want to reverse the original node ranking we can solve the system A^c — pc with a 
centrality vector c = {0.5, 0.05, 0.2, 0.25}. Notice that, in this case, the ranking induced by c is exactly the opposite of the one 
induced by Co: now node 1 is the most central one, followed in order by node 4, node 3, and node 2. The solution of Eq|2] gives 
uj — {a, 12, 15/4, 6, (3 — 10a)/5}, corresponding to a weighted network G w with all positive weights whenever < a < 3/10. 
The resulting network is shown in panel (c). 



of a given network is practically unfeasible, especially in 
large systems. Fortunately, this is not necessary, either. 
In fact, in the case of Fig. [TJ a weighted graph with all 
nodes having the same centrality score can also be ob- 
tained by changing the weights of only four links, while 
leaving unchanged the weight of the link from node 1 to 
node 2. More in general, it can be proved that the eigen- 
vector centrality of the whole network can be controlled 
by appropriately tuning the weights of just N of the K 
links. The only constraint is that the TV links must be- 
long to a subset E' C E such that, for every node i G V, 
there is a link £ G E' pointing to i (see Methods Section). 

This is illustrated in Fig. [2] for three real social net- 
works. In each of the three cases, it is possible to set 
any arbitrary eigenvector centrality by changing only the 
weights of the red arcs, while keeping unchanged (and 
equal to 1) the weights of all remaining arcs, shown in yel- 
low. The nodes from which the links in E' originate are 
also coloured in red, and are referred to as a controlling 
set of the network (see Methods Section). What is strik- 
ing is that, in each of the three networks, the set E' can be 
chosen in such a way that all the links in E' originate from 
a relatively small subset of nodes. For instance, the con- 
trolling set reported for the student government network 
of the University of Ljubljana contains only two nodes. 
This is also a minimum controlling set, since the graph 
does not admit another controlling set with a smaller 
number of nodes. This finding indicates that only two 
members of the student government, namely node 2 and 
node 8, can in principle set the centrality of all the other 
members by concurrently modifying the weights of some 
of their links. It is in fact reasonable to assume that the 
weight of the directed link from i to j, representing in 
this case the social credit (in terms of reputation, esteem 
or leadership acknowledgement) given by individual i to 
individual j, can be strengthen or decreased only by i. 



Consequently, nodes 2 and 8 can modify at their will the 
weights of their out-going links and, If these changes are 
opportunely coordinated, they can largely alter the ac- 
tual roles of all the other individuals. Analogously, only 
five monks can control the centrality of the Sampson's 
monk network, while only 4 members of the Zachary's 
karate club network can set the eigenvector centrality of 
the remaining 30 members. 

A question of practical interest is to investigate the 
size C = I C* I of the minimum controlling set in vari- 
ous complex networks. When C is small with respect to 
TV, then the centrality of the network is easy to control. 
Conversely, when the number of nodes in the minimum 
controlling set is large, the network G is more robust with 
respect to centrality manipulations. We have used two 
greedy algorithms to compute approximations of mini- 
mum controlling sets in various real systems (see Meth- 
ods Section). In Table [I] we report the best approximation 
for C, i.e. the size of the smallest controlling set C pro- 
duced by either of the two algorithms in networks whose 
sizes range from hundreds to millions of nodes. In the 
majority of the cases we have found unexpectedly small 
controlling sets, containing only up to 10 — 20% of the 
nodes of the network. For instance, in the graph of Jazz 
musicians, there exists a controlling set made by just 16 
of the 198 musicians. These 16 individuals alone can, in 
principle, decide to set the popularity of all the other mu- 
sicians, enhancing the centrality of some of the nodes and 
decreasing the centrality of others, just by playing more 
or less often with some of their first neighbours. Among 
all the networks we have considered, the one with the 
smallest controlling set is the Wikipedia talk communi- 
cation network, a graph with 2,394,385 nodes in which 
just 2% of nodes are able to alter the centrality of the en- 
tire system. The quantities in parenthesis indicate that 



FIG. 2. Minimum controlling sets in three real social networks. The graph in panel (a), with N = 11 vertices and 
K = 41 arcs, shows who asks who for an opinion among the members of the student government of the University of Ljubljana 
in 1992 [22]. The minimum controlling set is made by the two nodes marked in red, namely node 2 and node 8. These two 
nodes are linked to each other and point to all the remaining nodes in the graph. Therefore, nodes 2 and 8, by cooperatively 
modifying the weights of their red links, can set any arbitrary eigenvector centrality to the entire system. The graph in panel 
(b) has N = 18 nodes and K — 55 arcs, and describes the social relations between the monks of an isolated contemporary 
American monastery, as recorded by Sampson in 1969 [23]. Here, the minimum controlling set contains five nodes, shown 
in red. In this case, the subset of links E' (red links), does not contain links pointing to node 5, so that the red nodes can 
control the centrality of all the network nodes, except node 5. Finally, the Zachary's karate club network shown in panel (c) 
has N = 34 nodes and K — 78 undirected edges, and describes the social network of friendships among the members of a US 
university karate club in 1970 [24| • In this network, the minimum controlling set contains node 1, the instructor Mr. Hi, node 
34, the club administrator Mr. John A, and also nodes 7 and 26. Notice that just two nodes, namely 1 and 34, can control the 
centrality of 95% of the graph nodes. 



for this network a set of just 1% of the nodes can control 
the centrality of 99% of the nodes. 

For each real network G, we have also computed the 
typical size C(G rnd ) of the minimum controlling set in 
its randomised counterpart (see the rightmost column in 
Table H]). In particular, we have considered a randomisa- 
tion which preserves the degree sequence of the original 
graph. In most of the cases C(G) < C(G rnd ), relevant 
exceptions being some spatial man-made networks, such 
as power grids, road networks and electronic circuits, 
and also the patents citation network. This fact suggests 
that, in the absence of other limitations, such as strong 



spatial/geographic constraints [Tpj , the structure of real 
networks has naturally evolved to favour the control of 
spectral centrality by a small group of nodes. To bet- 
ter compare the controllability of networks with different 
sizes, we report in Fig.[3]the ratio r( ^ r l d ^ as a function of 
the number of graph nodes N. The smallest values of the 
ratio c(g™<*) are f oun d for collaboration/communication 
systems, WWW and socio-economical networks. The five 
most controllable networks are respectively Wiki-talk, In- 
ternet at the AS level, movie actors, the Stanford World 
Wide Web, and the collaboration network of researchers 
in astrophysics. These are all networks in which single 
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TABLE I. Number of nodes N, average degree (k), and the relative size C(G) of the mimimum controlling set found in 35 
different real world networks. The values of C(G) reported are expressed as percentage of the network size N. The algorithms 
used to find approximations of minimum controlling sets, mark as controllers also nodes not controlling other nodes, simply 
because, at a certain iteration of the greedy procedure, they have remained with no out-going links. Therefore, we also report in 
parenthesis the relative size of the effective minimum controlling set and the percentage of the controlled nodes. The notation 
x% — > y%, indicates that x% of the nodes is able to control the centrality of y% of the network. In the rightmost column 
we report, for each network, the relative size of the controlling set in randomized versions which preserve the original degree 
sequence. We have considered averages over 100 different randomizations. From top to bottom, the networks are divided into six 
classes, respectively World- Wide- Web, collaboration/communication, citation, spatial, words and socio-economical networks. 



nodes can tune, at their will, the weights of their out- 
going links. A scientist can decide whether to weaken or 
strengthen the connections to some of the collaborators. 
The administrators of an Internet Autonomous System 
can control the routing of traffic through neighbouring 
ASs, by modifying peering agreements [26|. And, simi- 
larly, the owner of a Web page can change the weights of 
hyperlinks, for instance by assigning them different sizes, 
colour, shapes and positions in the Web page. 

In this work, we have shown how a small number of en- 
tities, working cooperatively, can set any arbitrary eigen- 
vector centrality for all the nodes of a real complex net- 



work. It is straightforward to extend our results to other 
spectral centralities, such as a-centrality and Katz's cen- 
trality. Similar arguments can also be applied, with some 
limitations, to PageRank: in this case, the inverse cen- 
trality problem has solutions only for some particular 
choices of c. Such findings suggest that rankings obtained 
from centrality measures should be taken into account 
with extreme care, since they can be easily controlled 
and even distorted by a small group of cooperating nodes. 
The high controllability of real networks potentially has 
large social and commercial impact, given that centrality 
measures are nowadays extensively used to identify key 
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FIG. 3. Relative size of the minimum controlling set 
in various real systems. We report, as a function of N, 
the ratio between the sizes of the minimum controlling set 
in real networks and in their respective randomized versions 
(we have considered averages over 100 different realizations). 
Different symbols and colors refer to the six network classes 
considered in Table [I] The observed ratio is lower than 1 
in most of the cases, with the smallest values corresponding 
to collaboration/communication systems, WWW and socio- 
economical networks. The ratio is equal to 1 in three cases. 
The networks with ratio larger than 1, with the exception of 
one socio-economical system (namely Epinions), are all spa- 
tially constrained systems: three electronic circuits, the US 
power grid, and three road networks. 



actors, to rank Web pages, and also to assess the value 
of a scientific research. 



I. METHODS 

A. Solution to the inverse centrality problem. 

The set of N linear equations with K variable weights, 
cji, . . . , uo k-, in Eq. [T] can be rewritten as a system of N 
linear equations with K variables: 

Bu> = pc, (2) 

where now B is a TV x K matrix of real numbers, and 
u) = {lji, . . . , ujk}- Notice that the linear system in Eq.[2] 
has solutions since the rank of B is N < K (all the equa- 
tions are separated and each of the variables, cji, . . . , uor-, 
appears in one equation only), and the in-degree of all 
nodes is positive by definition. Hence, there always ex- 
ists u G M x such that Eq. [T]is satisfied. It is convenient 
to rewrite Eq. [2] in a form that emphasises the depen- 
dence of matrix B from c. We choose to label the arcs as 
follows: (i, Z), I = 1 . . . kl n denotes the Z-th arc entering 
node z, where kf 1 is the in-degree of node i. Likewise, 
Sij is the source of arc (i, Z), while oj^i is the correspond- 
ing weight. Using this notation, the i-th component of 



Eq. [2] can be written as: 

k T 

y^Ui,ic SiJ = pci (3) 
1=1 

By direct computation, one positive solution of Eq. [3] is 
given by 

pCi . N 

0Ji,l = 0Ji,2 = ■■■= U ijk in = — — (4) 

Ez=i 

where i = 1 ... TV, and by continuity there are infinite 
many solutions such that ojij are all positive. In particu- 
lar, if for node i we have kf 1 = 1, then the i-th equation 
of Eq. [3] has a unique solution, while if k 1 / 1 > 1, there are 
always infinitely many solutions depending on k 1 / 1 — 1 
parameters. Summing up, Eq. (j2j) has only one solution 
if all the node in-degrees are equal to one, while there 
are, in general, infinitely many solutions depending on 
K — N parameters. Notice that p can be different from 
po, meaning that it is also possible to set the value of the 
largest eigenvalue of the weighted graph. 

B. Tuning a subset of the graph links. 

Here, we show that it is not necessary to fix the weights 
of all the graph links in order to get an arbitrary central- 
ity vector c > 0. In fact, given a subset of links E' C E 
containing at least one incoming link for each node, it 
is sufficient to assign some positive weights &{t!) to each 
i' G E', while keeping uo(l) constant W G E \ E' ', for 
instance all equal to 1, such that the resulting weighted 
graph has eigenvector centrality equal to c. Without loss 
of generality we can assume that the first k\ > incoming 
links of each node i belong to E' \ so that the components 
of Eq. [3] can be written as: 

y^UJjjC Sia = pCj - ^ ^i,lCSi, u 2 = 1... TV (5) 

1=1 1=^+1 

Therefore, since q > for each 1 < i < TV, then there is 
a po > such that for every p > po 

pci - ^2 Vijcsij > 0, i = 1 ... N (6) 

l=k^ + l 

and hence, by a similar continuity argument as above, we 
can ensure that there are infinitely many positive solu- 
tions to Eql5l 

C. Finding minimum controlling sets. 

A controlling set of graph G is any set of nodes C C V 
such that: 

V = Cu(\J{jeV: etjEE}]. (7) 
\iec I 
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This means that, for each node j in the graph, at least 
one of the two following conditions holds: a) j G C, or 
b) j is pointed by at least one node in C. We use \C\ to 
denote the size of the controlling set, i.e. the number of 
nodes contained in C. Finding the minimum controlling 
set C* of a graph G, i.e. a controlling set having min- 
imal size, is equivalent to computing the so-called dom- 
ination number of G. The domination number problem 
is a well known NP-hard problem in graph theory (25| . 
Therefore, the size of the minimum controlling set can 
be determined exactly only for small N graphs as those 
in Fig. [21 To investigate larger graphs we have used two 
greedy algorithms. The first algorithm, called Top-Down 
Controller Search (TDCS), works as follows. We initially 
set Gt=o = G. We select the node io with the maximum 
out-degree in G t =o, and mark it as controller node. Then, 
all the nodes in the out-neighbourhood of io are marked 
as controlled and are removed from Gt=o, together with 
io itself. In this way, we obtain a new graph Gt=i, and 
we store the controller node io, together with the list of 
nodes controlled by io. Notice that, removing a generic 
node j from Gt=o 5 a l so implies that Gt=i does not con- 
tain any of the links pointing to j or originating from it. 
The same procedure is iteratively applied to G?t=i, Gt=2 
and so on, until all the nodes of G are either marked 
as controller or as controlled nodes. The algorithm pro- 
duces a set C = {io, U,^2, • • •}, with \C\ > |C*|, which 



is a controlling set of G by construction. The second al- 
gorithm is called Bottom-Up Controller Search (BUCS), 
and it works as follows. We set Gt=o = G and consider 
the set M(0) containing all the nodes in Gt=o with min- 
imum in-degree. For each node i G M(0), we consider 
the set of nodes pointing to i and select from this set 
the node mi with the maximal out-degree. This node 
is marked as controller. Then we obtain a new graph 
Gt=i by removing from Gt=o all the controller nodes mi 
for all i G M(0), together with all the nodes, marked as 
controlled, pointed by them. The same procedure is it- 
eratively applied to Gt=i, G t =2 and so on, until all the 
nodes of G are either marked as controller or as controlled 
nodes. If a graph G t contains isolated nodes, these are 
marked as controller and removed from Gt- The algo- 
rithm finally produces a set C = {io, ii, 22, • • •} which 
is a controlling set of G by construction. We have ver- 
ified that the controlling sets obtained by both TDCS 
and BUCS for each of the networks considered are much 
smaller than those obtained by randomly selecting the 
controlling nodes. Moreover, the set of controller nodes 
found by TDCS is in general different from that obtained 
on the same network by BUCS. Also the sizes of the two 
controlling sets obtained by the two algorithms are dif- 
ferent. In particular, we have noticed that in assort at ive 
(disassortative) networks the controlling set produced by 
TDCS is smaller (larger) than that produced by BUCS. 
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